FODAVA-Partner: Visualizing Audio for Anomaly Detection

ثبت نشده
چکیده

“Most people who handle money a lot (i.e. cashiers) can identify a lower-quality fake bill instantly just by touching it” [10]. Data analysts are like cashiers: a trained data analyst can detect anomalies “at a glance” when data is appropriately transformed. This is the goal of data visualization. This proposal addresses the type of audio anomalies that human data analysts hear instantly: angry shouting, trucks at midnight on a residential street, gunshots. The human ear detects anomalies of this type rapidly and with high accuracy. For example, rifle magazine insertion clicks are detected with 100% accuracy at 0 dB SNR in white noise, babble, or jungle noise [1]. Unfortunately, a data analyst can listen to only one sound at a time. Visualization shows the analyst many sounds at once, possibly allowing him or her to detect an anomaly several orders of magnitude faster than “real time.” This proposal aims to to render large audio data sets, comprising thousands of microphones or thousands of minutes, in the form of interactive graphics that reveal important anomalies at a glance. Precedents for such graphical rendering are familiar to audio professionals. A simple amplitudeversus-time graph reveals silences for a speech transcriber to skip past; a spectrogram reveals details of birdsong to an ornithologist. But many audio anomalies are not so easy to display: angry shouts versus enthusiastically shouted greetings; the clatter of overturned tables rather than mere dishwashing; spoken Thai in Berlin, or German in Bangkok. All these cases can submit to automatic anomaly detection, using probabilistic models of longand short-term spectral features of normal activity. Unfortunately, the state of the art in automatic audio event detection is not very accurate [112]. We propose to represent audio anomaly to the analyst using a type of overcomplete lossless encoding: automatic anomaly salience scores will be displayed together with raw audio features, allowing a human analyst to drill down at any point in the data in order to resolve discrepancies in the visible display. The goal of this proposal is to present to analysts a coherent visual summary of both probabilistic and raw spectral information. The measurable outcome of this research will be the speed with which analysts find audio anomalies that have been planted, by the experimenter, in a very large dataset. A successful research outcome will be a visual summary that lets the analyst detect most anomalies immediately (about 10,000× faster than real time), and all anomalies after brief interactive exploration (about 1,000× faster than real time). In short, the goal of this proposal is to transform, model, and reduce data for efficient effective visualization and analytic reasoning: • A simple time series is transformed into audio features and probabilistic model-based features, vastly reducing the quantity of data presented at one time to the analyst (fewer observations). • Multiple techniques of dimensionality reduction condense the breadth of the data presented to the analyst (fewer variables). • Computationally inexpensive multiscale caching of all layers, from summary variables down to the raw source data, supports interactive investigation of hypotheses at different spatial and temporal scales. • The interactive visualizations are efficient: caching increases the analyst’s decision rate. • The visualizations are effective: the decisions have a measurably low error rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3D Gabor Based Hyperspectral Anomaly Detection

Hyperspectral anomaly detection is one of the main challenging topics in both military and civilian fields. The spectral information contained in a hyperspectral cube provides a high ability for anomaly detection. In addition, the costly spatial information of adjacent pixels such as texture can also improve the discrimination between anomalous targets and background. Most studies miss the wort...

متن کامل

Moving dispersion method for statistical anomaly detection in intrusion detection systems

A unified method for statistical anomaly detection in intrusion detection systems is theoretically introduced. It is based on estimating a dispersion measure of numerical or symbolic data on successive moving windows in time and finding the times when a relative change of the dispersion measure is significant. Appropriate dispersion measures, relative differences, moving windows, as well as tec...

متن کامل

Visualizing Real-Time Network Resource Usage

We present NetGrok, a tool for visualizing computer network usage in real-time. NetGrok combines well-known information visualization techniques—overview, zoom & filter, details on demand—with network graph and treemap visualizations. NetGrok integrates these tools with a shared data store that can read PCAP-formatted network traces, capture traces from a live interface, and filter the data set...

متن کامل

Assessment Methodology for Anomaly-Based Intrusion Detection in Cloud Computing

Cloud computing has become an attractive target for attackers as the mainstream technologies in the cloud, such as the virtualization and multitenancy, permit multiple users to utilize the same physical resource, thereby posing the so-called problem of internal facing security. Moreover, the traditional network-based intrusion detection systems (IDSs) are ineffective to be deployed in the cloud...

متن کامل

Improving the RX Anomaly Detection Algorithm for Hyperspectral Images using FFT

Anomaly Detection (AD) has recently become an important application of target detection in hyperspectral images. The Reed-Xialoi (RX) is the most widely used AD algorithm that suffers from “small sample size” problem. The best solution for this problem is to use Dimensionality Reduction (DR) techniques as a pre-processing step for RX detector. Using this method not only improves the detection p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007